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IDENTIFICATION OF BROADLY REACTIVE DR RESTRICTED EPITOPES 



CROSS-REFERENCE TO RELATED APPLICATIONS 

5 The present application is a continuation in part of USSN 60/036,713, filed 

January 23, 1997 and 60/037,432 filed February 7, 1997, both of which are incorporated 
herein by reference. 

BACKGROUND OF THE INVENTION 

10 Helper T lymphocytes (HTL) play several important functions in immunity 

to pathogens. Firstly, they provide help for induction of both CTL and antibody 

responses. By both direct contact and by secreting lymphokines such as IL2 and IL4, 

HTL promote and support the expansion and differentiation of T and B cell precursors into 

effector cells. In addition, HTL can also be effectors in their own right, an activity also 
15 mediated by direct cell contact and secretion of lymphokines, such as IFNy and TNFa. 

HTL have been shown to have direct effector activity in case of tumors, as well as viral, 

bacterial, parasitic, and fimgal infections. 

HTL recognize a complex formed between Class II MHC molecules and 

antigenic peptides, usually between 10 and 20 residues long, and with an average size of 
20 between 13 and 16 amino acids. Peptide-Class II interactions have been analyzed in 

detail, both at the structural and functional level, and peptide motifs specific for various 

human and mouse Class II molecules have been proposed. 

In the last few years, epitope based vaccines have received considerable 

attention as a possible mean to develop novel prophylactic vaccines and immunotherapeutic 
25 strategies. Selection of appropriate T and B cell epitopes should allow to focus the 

immune system toward conserved epitopes of pathogens which are characterized by high 

sequence variability (such as HIV, HCV and Malaria). 

In addition, focusing the immune response towards selected determinants 

could be of value in the case of various chronic viral diseases and cancer, where T cells 
30 directed against the immunodominant epitopes might have been inactivated while T cells 

specific for subdominant epitopes might have escaped T cell tolerance. The use of epitope 
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based vaccines also allows to avoid "suppressive" T cell determinants which induce TH 2 
responses, in conditions where a response is desirable, or vice versa. 

Finally, epitope based vaccines also offer the opportunity to include in the 
vaccine construct epitopes that have been engineered to modulate their potency, either by 
increasing MHC binding affinity, or by alteration of its TCR contact residues, or both. 
Inclusion of completely synthetic non-natural or generically unrelated to the pathogen 
epitopes (such as TT derived "universal" epitopes), also represents a possible mean of 
modulating the HTL response toward a THj, or TH 2 phenotype. 

Once appropriate epitope determinants have been defined, they can be 
assorted and delivered by various means, which include lipopeptides, viral delivery 
vectors, particles of viral or synthetic origin, naked or particle absorbed cDNA. 

However, before appropriate epitopes can be defined, one major obstacle 
has to be overcome, namely the very high degree of polymorphism of the MHC molecules 
expressed in the human population. In fact, more than two hundred different types of 
HLA Class I and Class II molecules have already been identified. It has been 
demonstrated that in the case of HLA Class I molecules, peptides capable of binding 
several different HLA Class I molecules can be identified. Over 60% of the known HLA 
Class I molecules can, in fact, be grouped in four broad HLA supertypes, characterized by 
similar peptide binding specificities (HLA supermotifs). 

In the case of Class III molecules, it is also known that peptides capable of 
binding multiple HLA types and of being immunogenic in the context of different HLA 
molecules do indeed exist. Until now, however, a general method for their identification 
has not been developed, probably at least in part a reflection of the fact that quantitative 
DR binding assays are labor intensive and that a large number of alleles must to be 
considered. 

The present invention addresses these and other needs. 
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SUMMARY OF THE INVENTION 

The present invention is based, at least in part, on the discovery and 
validation of specific motifs and assay systems for various DR molecules, representative of 
the worldwide population. Their application to the identification of broadly degenerate 
HLA Class II binding peptides is also described. 

Definitions 

The term "peptide" is used interchangeably with "oligopeptide" in the 
present specification to designate a series of residues, typically L-amino acids, connected 
one to the other typically by peptide bonds between the alpha-amino and carbonyl groups 
of adjacent amino acids. The oligopeptides of the invention are less than about 50 residues 
in length and usually consist of between about 10 and about 30 residues, more usually 
between about 12 and 25, and often between about 15 and about 20 residues. 

An "immunogenic peptide" is a peptide which comprises an allele-specific 
motif such that the peptide will bind an MHC molecule and induce a HTL response. 
Immunogenic peptides of the invention are capable of binding to an appropriate HLA 
molecule and inducing HTL response against the antigen from which the immunogenic 
peptide is derived. 

A "conserved residue" is a conserved amino acid occupying a particular 
position in a peptide motif typically one where the MHC structure may provide a contact 
point with the immunogenic peptide. One to three, typically two, conserved residues 
within a peptide of defined length defines a motif for an immunogenic peptide. These 
residues are typically in close contact with the peptide binding groove, with their side 
chains buried in specific pockets of the groove itself. 

The term "motif" refers to the pattern of residues of defined length, usually 
between about 8 to about 11 amino acids, which is recognized by a particular MHC allele. 

The term "supermotif " refers to motifs that, when present in an 
immunogenic peptide, allow the peptide to bind more than one HLA antigen. The 
supermotif preferably is recognized by at least one HLA allele having a wide distribution 
in the human population, preferably recognized by at least two alleles, more preferably 
recognized by at least three alleles, and most preferably recognized by more than three 
alleles. 
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The phrases "isolated" or "biologically pure" refer to material which is 
substantially or essentially free from components which normally accompany it as found in 
its native state. Thus, the peptides of this invention do not contain materials normally 
associated with their in situ environment, e.g., MHC I molecules on antigen presenting 
5 cells. Even where a protein has been isolated to a homogenous or dominant band, there 
are trace contaminants in the range of 5-10% of native protein which co-purify with the 
desired protein. Isolated peptides of this invention do not contain such endogenous co- 
purified protein. 

The term "residue" refers to an amino acid or amino acid mimetic 
10 incorporated in an oligopeptide by an amide bond or amide bond mimetic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR4w4 binding capacity when occupying a particular 
15 position, relative to the main P1-P6 anchors. 

Figure 2A presents a map of the positive or negative effect of each of the 20 
naturally occurring amino acids on DR1 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

Figure 2B presents a map of the positive or negative effect of each of the 20 
20 naturally occurring amino acids on DR7 binding capacity when occupying a particular 
position, relative to the main P1-P6 anchors. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

The present invention relates to compositions and methods for preventing, 
25 treating or diagnosing a number of pathological states such as viral, fungal, bacterial and 
parasitic diseases and cancers. In particular, it provides novel peptides capable of binding 
selected major histocompatibility complex (MHC) class II molecules and inducing an 
immune response. 

Peptide binding to MHC molecules is determined by the allelic type of the 
30 MHC molecule and the amino acid sequence of the peptide. MHC class I-binding peptides 
usually contain within their sequence two conserved ("anchor") residues that interact with 
corresponding binding pockets in the MHC molecule. Specific combination of anchor 
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residues (usually referred to as "MHC motifs") required for binding by several allelic 
forms of human MHC (HLA, histocompatibility leukocyte antigens) are described in 
International Applications WO 94/03205 and WO 94/20127. Definition of specific MHC 
motifs allows one to predict from the amino acid sequence of an individual protein, which 
5 peptides have the potential of being immunogenic for CTL. These applications describe 
methods for preparation and use of immunogenic peptides in the treatment of disease. The 
peptides described here can also be used as helper T peptides in combination with peptides 
which induce a CTL response. This is described in WO 95/07077. 

The DR-binding peptides of the present invention or nucleic acids encoding 

10 them can be administered to mammals, particularly humans, for prophylactic and/or 

therapeutic purposes. The DR peptides can be used to enhance immune responses against 
other immunogens administered with the peptides, when the peptides of the invention are 
used as helper peptides. For instance, mixtures of peptides of the invention in 
combination with peptides that induce CTL responses may be used to treat and/or prevent 

15 viral infection and cancer. Alternatively, immunogens which induce antibody responses 
can be used. Examples of diseases which can be treated using the immunogenic mixtures 
of DR peptides and other immunogens include prostate cancer, hepatitis B, hepatitis C, 
AIDS, renal carcinoma, cervical carcinoma, lymphoma, CMV and condyloma 
acuminatum. 

20 The DR-binding peptides or nucleic acids encoding them may also be used 

to treat a variety of conditions involving unwanted T cell reactivity. Examples of diseases 
which can be treated using DR-binding peptides include autoimmune diseases (e.g., 
rheumatoid arthritis, multiple sclerosis, and myasthenia gravis), allograft rejection, 
allergies (e.g., pollen allergies), lyme disease, hepatitis, LCMV, post-streptococcal 

25 endocarditis, or glomerulonephritis, and food hypersensitivities. 

In therapeutic applications, the immunogenic compositions or the DR- 
binding peptides or nucleic acids of the invention are administered to an individual already 
suffering from cancer, autoimmune disease, or infected with the virus of interest. Those 
in the incubation phase or the acute phase of the disease may be treated with the DR- 

30 binding peptides or immunogenic conjugates separately or in conjunction with other 
treatments, as appropriate. 
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In therapeutic applications, compositions comprising immunogenic 
compositions are administered to a patient in an amount sufficient to elicit an effective 
immune response to the virus or tumor antigen and to cure or at least partially arrest 
symptoms and/or complications. Similarly, compositions comprising DR-binding peptides 

5 are administered in an amount sufficient to cure or at least partially arrest the symptoms of 
the disease and its complications. An amount adequate to accomplish this is defined as 
"therapeutically effective dose." Amounts effective for this use will depend on, e.g., the 
peptide composition, the manner of administration, the stage and severity of the disease 
being treated, the weight and general state of health of the patient, and the judgment of the 

10 prescribing physician. 

Therapeutically effective amounts of the immunogenic compositions of the 
present invention generally range for the initial immunization (that is for therapeutic or 
prophylactic administration) from about 1.0 jig to about 10,000 fig of peptide for a 70 kg 
patient, usually from about 100 to about 8000 fig, and preferably between about 200 and 

15 about 6000 ftg. These doses are followed by boosting dosages of from about 1.0 /ig to 

about 1000 fig of peptide pursuant to a boosting regimen over weeks to months depending 
upon the patient's response and condition by measuring specific immunogenic activity in 
the patient's blood. 

It must be kept in mind that the compositions of the present invention may 

20 generally be employed in serious disease states, that is, life-threatening or potentially life- 
threatening situations. In such cases, in view of the minimization of extraneous substances 
and the relative nontoxic nature of the conjugates, it is possible and may be felt desirable 
by the treating physician to administer substantial excesses of these compositions. 

For prophylactic use, administration should be given to risk groups. For 

25 example, protection against malaria, hepatitis, or AIDS may be accomplished by 

prophylactically administering compositions of the invention, thereby increasing immune 
capacity. Therapeutic administration may begin at the first sign of disease or the detection 
or surgical removal of tumors or shortly after diagnosis in the case of acute infection. 
This is followed by boosting doses until at least symptoms are substantially abated and for 

30 a period thereafter. In chronic infection, loading doses followed by boosting doses may be 
required. 
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Treatment of an infected individual with the compositions of the invention 
may hasten resolution of the infection in acutely infected individuals. For those 
individuals susceptible (or predisposed) to developing chronic infection the compositions 
are particularly useful in methods for preventing the evolution from acute to chronic 
infection. Where the susceptible individuals are identified prior to or during infection, for 
instance, as described herein, the composition can be targeted to them, minimizing need 
for administration to a larger population. 

The peptide mixtures or conjugates can also be used for the treatment of 
chronic infection and to stimulate the immune system to eliminate virus-infected cells in 
carriers. It is important to provide an amount of immuno-potentiating peptide in a 
formulation and mode of administration sufficient to effectively stimulate a cytotoxic T 
cell response. Thus, for treatment of chronic infection, a representative dose is in the 
range of about 1.0 fig to about 5000 /tg, preferably about 5 jig to 1000 jig for a 70 kg 
patient per dose. Immunizing doses followed by boosting doses at established intervals, 
e.g., from one to four weeks, may be required, possibly for a prolonged period of time to 
effectively immunize an individual. In the case of chronic infection, administration should 
continue until at least clinical symptoms or laboratory tests indicate that the viral infection 
has been eliminated or substantially abated and for a period thereafter. 

The pharmaceutical compositions for therapeutic or prophylactic treatment 
are intended for parenteral, topical, oral or local administration. Typically, the 
pharmaceutical compositions are administered parenterally, e.g., intravenously, 
subcutaneously, intradermally, or intramuscularly. Because of the ease of administration, 
the vaccine compositions of the invention are particularly suitable for oral administration. 
Thus, the invention provides compositions for parenteral administration which comprise a 
solution of the peptides or conjugates dissolved or suspended in an acceptable carrier, 
preferably an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, 
buffered water, 0.9% saline, 0.3% glycine, hyaluronic acid and the like. These 
compositions may be sterilized by conventional, well known sterilization techniques, or 
may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or 
lyophilized, the lyophilized preparation being combined with a sterile solution prior to 
administration. The compositions may contain pharmaceutical^ acceptable auxiliary 
substances as required to approximate physiological conditions, such as pH adjusting and 
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buffering agents, tonicity adjusting agents, wetting agents and the like, for example, 
sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, 
sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of DR and/or CTL stimulatory peptides of the invention 

5 in the pharmaceutical formulations can vary widely, i.e. , from less than about 0.1%, 

usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be 
selected primarily by fluid volumes, viscosities, etc., in accordance with the particular 
mode of administration selected. 

The peptides and conjugates of the invention may also be administered via 

10 liposomes, which serve to target the conjugates to a particular tissue, such as lymphoid 
tissue, or targeted selectively to infected cells, as well as increase the half-life of the 
peptide composition. Liposomes include emulsions, foams, micelles, insoluble 
monolayers, liquid crystals, phospholipid dispersions, lamellar layers and the like. In 
these preparations the peptide to be delivered is incorporated as part of a liposome, alone 

15 or in conjunction with a molecule which binds to, e.g. , a receptor prevalent among 

lymphoid cells, such as monoclonal antibodies which bind to the CD45 antigen, or with 
other therapeutic or immunogenic compositions. Thus, liposomes filled with a desired 
peptide or conjugate of the invention can be directed to the site of lymphoid cells, where 
the liposomes then deliver the selected therapeutic/immunogenic peptide compositions. 

20 Liposomes for use in the invention are formed from standard vesicle-forming lipids, which 
generally include neutral and negatively charged phospholipids and a sterol, such as 
cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome 
size, acid lability and stability of the liposomes in the blood stream. A variety of methods 
are available for preparing liposomes, as described in, e.g., Szoka, et ah, Ann. Rev. 

25 Biophys. Bioeng. 9, 467 (1980), U.S. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 
5,019,369, incorporated herein by reference. 

For targeting to the immune cells, a ligand to be incorporated into the 
liposome can include, e.g., antibodies or fragments thereof specific for cell surface 
determinants of the desired immune system cells. A liposome suspension containing a 

30 peptide or conjugate may be administered intravenously, locally, topically, etc. in a dose 
which varies according to, inter alia, the manner of administration, the conjugate being 
delivered, and the stage of the disease being treated. 
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Alternatively, DNA or RNA encoding one or more DR peptides and a 
polypeptide containing one or more CTL epitopes or antibody inducing epitopes may be 
introduced into patients to obtain an immune response to the polypeptides which the 
nucleic acid encodes. Wolff, eU al., Science 247: 1465-1468 (1990) describes the use of 
5 nucleic acids to produce expression of the genes which the nucleic acids encode. Such use 
is also disclosed in U.S. Patent Nos. 5,580,859 and 5,589,466. The nucleic acids can also 
be administered using ballistic delivery as described, for instance, in U.S. Patent No. 
5,204,253. Particles comprised solely of DNA can be administered. Alternatively, DNA 
can be adhered to particles, such as gold particles. The nucleci acids can also be delivered 

10 complexed to cationic compounds, such as cationic lipids. Lipid-mediated gene delivery 
methods are described, for instance, in WO 96/18372; WO 93/24640; Mannino and 
Gould-Fogerite (1988) BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; WO 
91/06309; and Feigner et al. (1987) Proc. Natl. Acad. Sci. USA 84: 7413-7414. The 
peptides of the invention can also be expressed by attenuated viral hosts, such as vaccinia 

15 or fowlpox. This approach involves the use of vaccinia virus as a vector to express 

nucleotide sequences that encode the peptides of the invention. Upon introduction into an 
acutely or chronically infected host or into a noninfected host, the recombinant vaccinia 
virus expresses the immunogenic peptide, and thereby elicits a host CTL response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 

20 Patent No. 4,722,848, incorporated herein by reference. Another vector is BCG (Bacille 
Calmette Guerin). BCG vectors are described in Stover et al. (Nature 351:456-460 
(1991)) which is incorporated herein by reference. A wide variety of other vectors useful 
for therapeutic administration or immunization of the peptides of the invention, e.g., 
Salmonella typhi vectors and the like, will be apparent to those skilled in the art from the 

25 description herein. 

A preferred means of administering nucleic acids encoding the peptides of 
the invention uses minigene constructs encoding multiple peptides of the invention along 
with CTL inducing peptides. To create a DNA sequence encoding the selected DR 
peptides and CTL epitopes for expression in human cells, the amino acid sequences of the 

30 epitopes are reverse translated. A human codon usage table is used to guide the codon 
choice for each amino acid. These epitope-encoding DNA sequences are directly 
adjoined, creating a continuous polypeptide sequence. To optimize expression and/or 
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immunogenicity, additional elements can be incorporated into the minigene design. 
Examples of amino acid sequence that could be reverse translated and included in the 
minigene sequence include: DR peptides of the invention, a leader (signal) sequence, one 
or more CTL epitope, and an endoplasmic reticulum retention signal. In addition, MHC 
5 presentation of CTL epitopes may be improved by including synthetic (e.g. poly-alanine) 
or naturally-occurring flanking sequences adjacent to the CTL epitopes. 

The minigene sequence is converted to DNA by assembling oligonucleotides 
that encode the plus and minus strands of the minigene. Overlapping oligonucleotides (30- 
100 bases long) are synthesized, phosphorylated, purified and annealed under appropriate 

10 conditions using well known techniques, he ends of the oligonucleotides are joined using 
T4 DNA ligase. This synthetic minigene, encoding the CTL epitope polypeptide, can then 
cloned into a desired expression vector. 

Standard regulatory sequences well known to those of skill in the art are 
included in the vector to ensure expression in the target cells. Several vector elements are 

15 required: a promoter with a down-stream cloning site for minigene insertion; a 
polyadenylation signal for efficient transcription termination; an E. coli origin of 
replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin resistance). 
Numerous promoters can be used for this purpose, e.g., the human cytomegalovirus 
(hCMV) promoter. See, U.S. Patent Nos. 5,580,859 and 5,589,466 for other suitable 

20 promoter sequences. 

Additional vector modifications may be desired to optimize minigene 
expression and immunogenicity. In some cases, introns are required for efficient gene 
expression, and one or more synthetic or naturally-occurring introns could be incorporated 
into the transcribed region of the minigene. The inclusion of mRNA stabilization 

25 sequences can also be considered for increasing minigene expression. It has recently been 
proposed that immunostimulatory sequences (ISSs or CpGs) play a role in the 
immunogenicity of DNA vaccines. These sequences could be included in the vector, 
outside the minigene coding sequence, if found to enhance immunogenicity. 

In some embodiments, a bicistronic expression vector, to allow production 

30 of the minigene-encoded epitopes and a second protein included to enhance or decrease 

immunogenicity can be used. Examples of proteins or polypeptides that could beneficially 
enhance the immune response if co-expressed include cytokines (e.g., IL2,TL12, GM- 



WO 98/32456 ^ PCT/US98/01373 

CSF), cytokine-inducing molecules (e.g. LeIF) or costimulatory molecules. The HTL 
epitopes of the invention could be joined to intracellular targeting signals and expressed 
separately from the CTL epitopes. This would allow direction of the HTL epitopes to a 
cell compartment different than the CTL epitopes. If required, this could facilitate more 
5 efficient entry of HTL epitopes into the MHC class II pathway, thereby improving CTL 
induction. In contrast to CTL induction, specifically decreasing the immune response by 
co-expression of immunosuppressive molecules (e.g. TGF-P) may be beneficial in certain 
diseases. 

Once an expression vector is selected, the minigene is cloned into the 
10 polylinker region downstream of the promoter. This plasmid is transformed into an 
appropriate E. coli strain, and DNA is prepared using standard techniques. The 
orientation and DNA sequence of the minigene, as well as all other elements included in 
the vector, are confirmed using restriction mapping and DNA sequence analysis. Bacterial 
cells harboring the correct plasmid can be stored as a master cell bank and a working cell 
15 bank. 

Therapeutic quantities of plasmid DNA are produced by fermentation in E. 
coli, followed by purification. Aliquots from the working cell bank are used to inoculate 
fermentation medium (such as Terrific Broth), and grown to saturation in shaker flasks or 
a bioreactor according to well known techniques. Plasmid DNA can be purified using 

20 standard bioseparation technologies such as solid phase anion-exchange resins supplied by 
Quiagen. If required, supercoiled DNA can be isolated from the open circular and linear 
forms using gel electrophoresis or other methods. 

Purified plasmid DNA can be prepared for injection using a variety of 
formulations. The simplest of these is reconstitution of lyophilized DNA in sterile 

25 phosphate-buffer saline (PBS). A variety of methods have been described, and new 
techniques may become available. As noted above, nucleic acids are conveniently 
formulated with cationic lipids. In addition, glycolipids, fusogenic liposomes, peptides 
and compounds referred to collectively as protective, interactive, non-condensing (PINC) 
could also be complexed to purified plasmid DNA to influence variables such as stability, 

30 intramuscular dispersion, or trafficking to specific organs or cell types. 

Target cell sensitization can be used as a functional assay for expression and 
MHC class I presentation of minigene-encoded CTL epitopes. The plasmid DNA is 
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introduced into a mammalian cell line that is suitable as a target for standard CTL 
chromium release assays. The transfection method used will be dependent on the final 
formulation. Electroporation can be used for "naked" DNA, whereas cationic lipids allow 
direct in vitro transfection. A plasmid expressing green fluorescent protein (GFP) can be 
5 co-transfected to allow enrichment of transfected cells using fluorescence activated cell 
sorting (FACS). These cells are then chromium-51 labeled and used as target cells for 
epitope-specific CTL lines. Cytolysis, detected by 51Cr release, indicates production of 
MHC presentation of minigene-encoded CTL epitopes. 

In vivo immunogenicity is a second approach for functional testing of 

10 minigene DNA formulations. Transgenic mice expressing appropriate human MHC 

molecules are immunized with the DNA product. The dose and route of administration 
are formulation dependent (e.g. IM for DNA in PBS, IP for lipid-complexed DNA). 
Twenty-one days after immunization, splenocytes are harvested and restimulated for 1 
week in the presence of peptides encoding each epitope being tested. These effector cells 

15 (CTLs) are assayed for cytolysis of peptide-loaded, chromium-51 labeled target cells using 
standard techniques. Lysis of target cells sensitized by MHC loading of peptides 
corresponding to minigene-encoded epitopes demonstrates DNA vaccine function for in 
vivo induction of CTLs. 

For solid compositions, conventional nontoxic solid carriers may be used 

20 which include, for example, pharmaceutical grades of mannitol, lactose, starch, 

magnesium stearate, sodium saccharin, talcum, cellulose, glucose, sucrose, magnesium 
carbonate, and the like. For oral administration, a pharmaceutically acceptable nontoxic 
composition is formed by incorporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 10-95% of active ingredient, that is, one or 

25 more conjugates of the invention, and more preferably at a concentration of 25% -75%. 

For aerosol administration, the peptides are preferably supplied in finely 
divided form along with a surfactant and propellant. Typical percentages of conjugates are 
0.01%-20% by weight, preferably 1%-10%. The surfactant must, of course, be nontoxic, 
and preferably soluble in the propellant. Representative of such agents are the esters or 

30 partial esters of fatty acids containing from 6 to 22 carbon atoms, such as caproic, 

octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an 
aliphatic polyhydric alcohol or its cyclic anhydride. Mixed esters, such as mixed or 
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natural glycerides may be employed. The surfactant may constitute 0.1%-20% by weight 
of the composition, preferably 0.25-5%. The balance of the composition is ordinarily 
propellant. A carrier can also be included, as desired, as with, e.g., lecithin for intranasal 
delivery. 

5 In another aspect the present invention is directed to vaccines which contain 

as an active ingredient an immunogenically effective amount of an immunogenic DR 
peptide or a CTIADR peptide conjugate or nucleic acid encoding them as described herein. 
The conjugate(s) may be introduced into a host, including humans, linked to its own 
carrier or as a homopolymer or heteropolymer of active peptide units. Such a polymer has 

10 the advantage of increased immunological reaction and, where different peptides are used 
to make up the polymer, the additional ability to induce antibodies and/or CTLs that react 
with different antigenic determinants of the virus or tumor cells. Useful carriers are well 
known in the art, and include, e.g., thyroglobulin, albumins such as bovine serum 
albumin, tetanus toxoid, polyamino acids such as poly(lysine: glutamic acid), hepatitis B 

15 virus core protein, hepatitis B virus recombinant vaccine and the like. The vaccines can 
also contain a physiologically tolerable (acceptable) diluent such as water, phosphate 
buffered saline, or saline, and further typically include an adjuvant. Adjuvants such as 
incomplete Freund's adjuvant, aluminum phosphate, aluminum hydroxide, or alum are 
materials well known in the art. And, as mentioned above, CTL responses can be primed 

20 by conjugating peptides of the invention to lipids, such as P 3 CSS. Upon immunization 

with a peptide composition as described herein, via injection, aerosol, oral, transdermal or 
other route, the immune system of the host responds to the vaccine by producing large 
amounts of CTLs specific for the desired antigen, and the host becomes at least partially 
immune to later infection, or resistant to developing chronic infection. 

25 Vaccine compositions containing the DR peptides of the invention are 

administered to a patient susceptible to or otherwise at risk of disease, such as viral 
infection or cancer to elicit an immune response against the antigen and thus enhance the 
patient's own immune response capabilities, for instance with CTL epitopes described in 
**. Such an amount is defined to be an "immunogenically effective dose." In this use, the 

30 precise amounts again depend on the patient's state of health and weight, the mode of 

administration, the nature of the formulation, etc., but generally range from about 1.0 /ig 
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to about 5000 fig per 70 kilogram patient, more commonly from about 10 fig to about 500 
fig per 70 kg of body weight. 

In some instances it may be desirable to combine the peptide vaccines of the 
invention with vaccines which induce neutralizing antibody responses to the virus of 
5 interest, particularly to viral envelope antigens. For instance, PADRE peptides can be 
combined with hepatitis vaccines to increase potency or broaden population coverage. 
Suitable hepatitis vaccines that can be used in this manner include, Recombivax HB® 
(Merck) and Engerix-B (Smith-Kline). 

For therapeutic or immunization purposes, the peptides of the invention can 

10 also be expressed by attenuated viral hosts, such as vaccinia or fowlpox. This approach 
involves the use of vaccinia virus as a vector to express nucleotide sequences that encode 
the peptides of the invention. Upon introduction into an acutely or chronically infected 
host or into a non-infected host, the recombinant vaccinia virus expresses the immunogenic 
peptide, and thereby elicits a host CTL response. Vaccinia vectors and methods useful in 

15 immunization protocols are described in, e.g., U.S. Patent No. 4,722,848, incorporated 
herein by reference. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al., Nature 351, 456-460 (1991)) which is incorporated herein by 
reference. A wide variety of other vectors useful for therapeutic administration or 
immunization of the peptides of the invention, e.g., Salmonella typhi vectors and the like, 

20 will be apparent to those skilled in the art from the description herein. 

Antigenic conjugates may be used to elicit CTL ex vivo, as well. The 
resulting CTL can be used to treat chronic infections (viral or bacterial) or tumors in 
patients that do not respond to other conventional forms of therapy, or will not respond to 
a peptide vaccine approach of therapy. Ex vivo CTL responses to a particular pathogen 

25 (infectious agent or tumor antigen) are induced by incubating in tissue culture the patients 
CTL precursor cells (CTLp) together with a source of antigen-presenting cells (APC) and 
the appropriate immunogenic peptide. After an appropriate incubation time (typically 1-4 
weeks), in which the CTLp are activated and mature and expand into effector CTL, the 
cells are infused back into the patient, where they will destroy their specific target cell (an 

30 infected cell or a tumor cell). 

The peptides of this invention may also be used to make monoclonal 
antibodies. Such antibodies may be useful as potential diagnostic or therapeutic agents. 
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The peptides may also find use as diagnostic reagents. For example, a 
peptide of the invention may be used to determine the susceptibility of a particular 
individual to a treatment regimen which employs the peptide or related peptides, and thus 
may be helpful in modifying an existing treatment protocol or in determining a prognosis 
5 for an affected individual. In addition, the peptides may also be used to predict which 
individuals will be at substantial risk for developing chronic infection. 

Examples 

Materials and Methods 

Cells. The following Epstein-Barr virus (EBV) transformed homozygous cell lines 

10 were used as sources of human HLA Class II molecules: LG2 [DRBlcOlOl (DR1)1; 
GM3107 [DRB50101 (DR2w2a)]; MAT (DRB10301 (DR3)1; PREISS [DRB10401 
(DR4w4)l; BIN40 [DRB10404 (DR4wl4)l; SWEIG [DRB11101 (DRSwll)]; PITOUT 
[DRB10701 (DR7)] (a); KT3 [DRB10405 (DR4wl5)]; Herluf [DRB11201 (DR5wl2)]; 
HO301 [DRB11302 (DR6wl9)]; OLL [DRB10802 (DR8w2)]; and HTC9074 [DRB10901 

15 (DR9), supplied as a kind gift by Dr. Paul Harris, Columbia University], In some 

instances, transfected fibroblasts were used: L466.1 [DRB11501 (DR2w2b)]; TR81.19 
[DRB30101 (DR52a)]; and L257.6 [DRB40101 (DRw53)]. (Valli, etal J. Clin. Invest. 
91:616 (1993). Cells were maintained in vitro by culture in RPMI 1640 medium 
supplemented with 2mM L-glutamine [GIBCO, Grand Island, NY], 50/xM 2-ME, and 

20 10% heat-inactivated FCS [Irvine Scientific, Santa Ana, CA]. Cells were also 
supplemented with 100 /ig/ml of streptomycin and lOOU/ml of penicillin [Irvine 
Scientific]. Large quantities of cells were grown in spinner cultures. 

Cells were lysed at a concentration of 10 8 cells/ml in PBS containing 1 % 
NP-40 [Fluka Biochemika, Buchs, Switzerland], ImM PMSF [CalBioChem, La Jolla, 

25 CA], 5mM Na-orthovanadate, and 25mM iodoacetamide [Sigma Chemical, St. Louis, 

Mo], The lysates were cleared of debris and nuclei by centrifugation at 10,000 x g for 20 
min. 

Affinity purification of HLA-DR molecules. Class II molecules were purified by 
30 affinity chromatography as previously described (Sette, et al J. Immunol. 142:35 (1989) 
and Gorga, et al 7. Biol Chem. 262:16087 (1987)) using the mAb LB3.1 coupled to 
Sepharose 4B beads. Lysates were filtered through 0.8 and 0.4 fiM filters and then passed 
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over the anti-DR column, which were then washed with 15-column volumes of lOmM 
TRIS in 1% NP-40, PBS and 2-column volumes of PBS containing 0.4% 
n-octylglucoside. Finally, the DR was eluted with 50mM diethylamine in 0.15M NaCl 
containing 0.4% n-octylglucoside, pH 11.5. A 1/25 volume of 2.0M Tris, pH 6.8, was 
5 added to the eluate to reduce the pH to -8.0, and then concentrated by centrifugation in 
Centriprep 30 concentrators at 2000 rpm (Amicon, Beverly, MA). 

Class II peptide-binding assays. A panel of 13 different specific DR-peptide assays were 
utilized in the present study. These assays were chosen as to be representative of the most 

10 common t)R alleles. Table I lists for each DR antigen, the representative allelic product 
utilized, the cell line utilized as a source of DR, and the radiolabled probe utilized in the 
assay. Purified human Class II molecules [5 to 500 nM] were incubated with various 
unlabeled peptide inhibitors and 1-10 nM 125 I-radiolabeled probe peptides for 48h in PBS 
containing 5% DMSO in the presence of a protease inhibitor cocktail. The radiolabeled 

15 probes used were HA Y307-319 (DR1), Tetanus Toxoid[TT] 830-843 (DR2w2a, 

DRSwlll, DR7, DR8w2, DR8w3, DR9), MBP Y85-100 (DR2w2b), TT1272-1284 
(DR52a), MT 65 kD Y3-13 with Y7 substituted with F for DR3, a non-natural peptide 
with the sequence YARFQSQTTLKQKT (DR4w4, DR4wl5, DRw53) (Valli, et al 
supra), and for DR5wl2, a naturally processed peptide eluted from the cell line C1R, 

20 EALIHQLINPYVLS (DR5wl2) and 650.22 peptide, (TT 830-843 A - S836 analog), for 
DR6wl9. 

Radiolabeled peptides were iodinated using the chloramine-T method. 
Peptide inhibitors were typically tested at concentrations ranging from 1201 /xg/ml to 1.2 
ng/ml. The data were then plotted and the dose yielding 50% inhibition (IC50) was 

25 measured. In appropriate stoichiometric conditions, the IC50 of an unlabeled test peptide 
to the purified DR is a reasonable approximation of the affinity of interaction (Kd). 
Peptides were tested in two to four completely independent experiments. The final 
concentrations of protease inhibitors were: ImM PMSF, 1.3nM 1.10 phenanthroline, 73 
pM pepstatin A, 8mM EDTA, and 200 jiM N alpha-p-tosyl-L-lysine chloromethyl ketone 

30 (TLCK) [All protease inhibitors from CalBioChem, La Jolla, CA]. Final detergent 

concentration in the incubation mixture was 0.05% Nonidet P-40. Assays were performed 
at pH 7.0 with the exception of DR3, which was performed at pH 4.5, and DRw53, which 
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was performed at pH 5.0. The pH was adjusted as previously described (Sette, et al. J. 
Immunol 148:844 (1992)). 

Class II peptide complexes were separated from free peptide by gel 
filtration on TSK2000 columns (TosoHaas 16215, Montgomery ville, PA), and the fraction 
5 of bound peptide calculated as previously described (Sette, et al. , (1989) supra). In 
preliminary experiments, the DR prep was titered in the presence of fixed amounts of 
radiolabeled peptides to determine the concentration of Class II molecules necessary to 
bind 10-20% of the total radioactivity. All subsequent inhibition and direct binding assays 
were the performed using these Class II concentrations. 

10 

DRB1 specificity of DR4wl5, DR6wl9, DR8w2, DR8w3, and DR9 assays. 

Because the antibody used for purification is a-chain specific, pi molecules 
are not separated from 03 (and/or 04 and p5) molecules. Development and validation of 
assays in regard with DRP chain specificity has been described in detail elsewhere for 

15 many of the DR alleles listed above (108). Herein we describe for the first time DR4wl5, 
DR6wl9, DR8w2, DR8w3, and DR9 assays. Experiments addressing the p chain 
specificity of these new assays are described in the present section. 

DR4wl5. The P4 product DRw53 is co-expressed with DR4wl5 and the 
determination of the specificity of the DR4wl5 binding assay is complicated in that the 

20 same radiolabeled ligand is used for both the DR4wl5 and DRw53 binding assays. Since 
typically pi chains are expressed at 5-10 fold higher levels than other p chains, and all 
binding assays are performed utilizing limiting DR amounts, it would be predicted that the 
dominant specificity detected in the assay would be DR4wl5. To verify that this was 
indeed the case, the binding pattern of a panel of 58 different synthetic peptides in the 

25 putative DR4wl5 specific assay with that obtained in a DRw53 specific assay (which uses 
a DRw53 fibroblast as the source of Class II molecules). Two very distinct binding 
patterns were noted, and in several instances, a peptide bound to one DR molecule with 
high affinity, and did not bind to the other (data not shown). 

DR6wl9. The DR6wl9 assay utilizes as the source of Class II molecules 

30 the EBV transformed homozygous cell line H0301 , which co-expresses DRB30301 

(DR52a). While the radiolabeled ligand used in the DR6wl9 assay is different than that 
used for the DR52a assay, the ligand is related (i.e., is a single substitution analog) to a 
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high affinity DR52a binder. As was done in the case of DR4wl5, the specificity of the 
assay was investigated by analyzing the binding capacity of a panel of naturally occurring 
peptides for DR6wl9 and DR52a. The two assays demonstrated completely different 
binding specificities. For example, in terms of relative binding, TT 1272-1284 binds 
5 63-fold better in the DR52a assay than in the DR6wl9 assay. Conversely, the Invariant 
chain peptide binds 189-fold better in the DR6wl9 assay. In conclusion, these data 
demonstrated that the binding of the radiolabeled peptide 650.22 to purified Class II MHC 
from the H0301 cell line is specific for DR6wl9. 

DR8w2 and DR8w3. The pi specificity of the DR8w2 and DR8w3 assays 
10 is obvious in that no p3 (and/or B4 and p5) molecule is expressed. 

DR9. The specificity of DR9 assay is inferred from previous studies which 
have shown that the TT 830-843 radiolabeled probe peptide does not bind to DRw53 
molecules (Alexander, et al, Immunity 1:751 (1994)). 

15 Results 

DR binding affinity of antigenic peptides recognized by DR restricted T cells 

To define a threshold DR binding affinity, to be considered as biologically 
significant, we compiled the affinities of a panel of 32 reported instances of DR restriction 
of a given T cell epitope. In approximately half of the cases, DR restriction was 

20 associated with affinities of less than 100 nM, and in the other half of the instances, with 
IC50% in the 100-1000 nM range. Only in 1 out of 32 cases (3.1%) DR restriction was 
associated with IC50% of 1000 nM or greater. It was noted that this distribution of 
affinities differs from what was previously reported for HLA class I epitopes, where a vast 
majority of epitopes bound with IC50% of 50 nM or less (Sette, et al , JI, 1994). This 

25 relatively lower affinity of class II restricted epitope interactions might explain why 

activation of class II restricted T cells in general requires more antigen relative to class I 
restricted T cells. 

In conclusion, this analysis suggested that 1000 nM may be defined as an 
affinity threshold associated with immunogenicity in the context of DR molecules, and for 
30 this reason a suitable target for our studies. 



PI and P6 anchors are necessary but not sufficient for DRB10401 binding 
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Several independent studies have pointed to a crucial role in DRB 10401 
binding of a large aromatic or hydrophobic residue in position 1, near the N-terminus of 
the peptide and of a 9-residue core region (residues 1 through 9), In addition, an 
important role has been demonstrated for the residue in position six (P6) of this 9-residues 
5 core region. Short and/or hydrophobic residues were in general preferred in this position 
(O'Sullivan, etaL, JI 147:2663, 1991; Sette, etal y JI 151:3163, 1993; Hammer, etal. 7 
Cell 74:197, 1993 and Marshall, et al. 9 JI 154:5927, 1995). 

In the present set of experiments, a library of 384 peptides was analyzed for 
DRB10401 binding capacity and screened for the presence of the P1-P6 motif (that is, F, 

10 W, Y, L, I, V or M in PI and S, T, C, A, P, V, I, L or M in P6, at least 9 residues apart 
from the peptide C-terminus. This set of 384 peptides contained a total of 80 DR4w4 
binders (specifically 27 good binders [IC50 of 100 nM or less], and 53 intermediate 
binders [IC50 of the 100-1000 range]. Seventy-seven out of the 80 DR4w4 binders (96%) 
carried the P1-P6 motif. However, it should be noted that most non-DR4w4 binding 

15 peptides also contained the P1-P6 motif. Of 384 peptides included in our database, only 
125 were "P1-P6 negative." Only three of them (6%) bound appreciably to purified 
DR4w4 as opposed to 77/259 (30%) of the "P1-P6 positive" peptides. Therefore, these 
results demonstrate that presence of suitable PI and P6 anchors are necessary but not 
sufficient for DRB10401 binding. 

20 

A detailed map of DRB10401 peptide interactions 

Next, for each P1-P6 aligned core region, in analogy with what the strategy 
previously utilized to detail peptide class I interactions the average binding affinity of 
peptides carrying a particular residue, relative to the remainder of the group, were 

25 calculated for each position. Following this method a table of average relative binding 
(ARB) values was compiled. This table also represents a map of the positive or negative 
effect of each of the 20 naturally occurring amino acids on DRB10401 binding capacity 
when occupying a particular position, relative to the main P1-P6 anchors (Figure 1). 

Variations in ARB values greater than four fold (ARB ^ 4 or ^ 0.25) were 

30 arbitrarily considered significant and indicative of secondary effects of a given residue on 
DR-peptide interactions. Most secondary effects were associated with positions 4, 7, and 
9. These positions correspond to secondary anchors engaging shallow pockets on the DR 
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molecule. In addition, significant secondary effects were detected for M in position 3 
(ARB = 12.8) T in position 3 (ARB = 4.34) and I in position 5 (ARB = 4.4). 

Development of a DRB10401 specific algorithm 
5 Next, the ARB table was utilized to develop a DRB 10401 specific 

algorithm. In order to predict 0401 binding propensity, each aligned P1-P6 sequence was 
scored by multiplying, for each position, the ARB value of the appropriate amino acid. 
According to this procedure, a numerical "algorithm score" was derived. If multiple 
P1-P6 alignments were possible, binding scores were calculated for each one and the best 

10 score was selected. The efficacy Of this method in predicting 0401 binding capacity is 
shown in Table Ha. 

Considering only peptides with algorithm scores above -17.00 narrowed the 
set of predicted peptides to 156. This set still contained 72 out of 80 (90%) of the total 
high or intermediate DR binders. Raising the cut-off to an algorithm score of -16.44 or 

15 higher still allowed identification of 60 out of 80 (75%) of the DR4w4 binding peptides. 
Of the whole 107 peptide set, twenty-five of them were either good or intermediate 
binders. In other words, as expected, increasing the algorithm score stringency predicted 
a smaller fraction of the total binders present in the set, but at the same time less false 
positive peptides were identified. 

20 

Blind test of the predictive power of the DRB10401 specific algorithm 

To verify that the predictive capacity of our algorithm was not merely a 
reflection of having utilized the same data set to test and define the algorithm itself, we 
further examined its efficacy in a blind prediction test. For this scope we utilized data 

25 from an independent set of 50 peptides, whose binding affinities were known, but that had 
not been utilized in the derivation of the algorithm. As shown in Table lib, the algorithm 
was effective in predicting DR4w4 binding capacity of this independent peptide set. The 
algorithm score of -17.00 identified a total 18 peptides. This set contained 3/3 (100%) of 
all good binders, and 8/11 (70%) of all intermediate binders in the entire test set of 50 

30 peptides. Increasing the cut-off value to -16.44, identified a set of nine peptides. Seven 
of them (78%) were either good or intermediate binders. This set contained 7 out of 14 
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(50%) of the binders contained in the blind prediction peptide set. In conclusion, these 
data supports the validity of the DR4w4 specific algorithm described above. 

Detailed maps of DRB10401, DRB10101, and DRB10701 peptide binding specificities 
5 Next, we analyzed the binding to purified DR1 and DR7 molecules for the 

same set of 384 peptides utilized to define the DR4w4 algorithm. It was found that this set 
contained 120 and 59 binders for the DR1 and DR7 alleles, respectively. A total of 158 
peptides were capable of binding either DR1 , DR4w4 or DR7. A large fraction of them 
(73/158; 46%) were also degenerate binders, which bound two or more of the three alleles 
10 thus far considered. Furthermore, we also found that more than 90% of the DR1 or DR7 
good and intermediate binders carried the P1-P6 motif. Most importantly, 72 out of 73 
(99%) degenerate DR binders carried this motif (data not shown). In conclusion, this 
analysis suggests that P1-P6 based algorithms might be utilized to effectively predict 
degenerate DR binders. 

15 In analogy with what was described above for DR4w4 molecules, specific 

algorithms were designed for the DR1 and DR7 alleles. Figures 2A and 2B detail the 
allele specific maps defined according to this method. 

As in the case of DRB10401, most secondary effects were concentrated in 
positions 4, 7 and 9. Position 4 was especially prominent in the case of DR1, while 

20 position 7 was the most prominent secondary anchor for DR7. Specific algorithms were 
developed based on these maps, and it was found that the cut-off values necessary to 
predict 75% or 90% of the binders were -19.32 and -20.28 for DR1, and 20.91 and 
-21.63 for DR7, respectively. Depending on the particular allele or cut off value selected, 
40 to 60% of the predicted peptides were in fact good or intermediate binders (data not 

25 shown). 

Development of a DR1-4-7 combined algorithm 

Finally, we examined whether a combined algorithm would allow to predict 
degenerate binders. For this purpose, the sequences of the 384 peptides in our database 
30 were simultaneously screened with the three (DR1, 4w4, and 7) specific algorithms. It 
was found that an even 100 peptides were predicted (using the 75% cut off) to bind either 
two or three of the alleles considered. This set contained 59 out of 73 (81 %) of the 
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peptides which were in fact capable of degenerate 1-4-7 binding (defined as the capacity to 
bind to more than one of the DR1, 4w4 or 7 alleles) (Table EI). 

Definition of a target set of DR specificities, representative of the world population 

5 The data presented in the preceding sections illustrates how peptides capable 

of binding multiple DR alleles can be identified by the use of a combined "1-4-7" - 
algorithm. Next, we wished to examine whether the peptides exhibiting degenerate 1-4-7 
binding behavior would also bind other common DR types as well. As a first step in our 
experimental strategy, we sought to define a set of target DR types representative of a 

10 large (^ 80%) fraction of the world population, irrespective of the ethnic population of 

origin. For this purpose, seven additional DR antigens were considered. For each one of 
the DR antigens considered in this study, (including DR1, 4 and 7), the estimated 
frequency in various ethnicities, according to the most recent HLA workshop (11th, 1991) 
is shown in Table IVa, together with the main subtypes thus far identified. 

15 For the purpose of measuring peptide binding affinity to the various DR 

molecules, one representative subtype for each DR antigen was chosen (Table I). It 
should be noted that for most antigens, either one subtype is by far the most abundant, or 
alternatively a significant degree of similarity in the binding pattern displayed by the 
different, most abundant subtypes of each DR antigen is likely to exist (see comments 

20 column of Table IVb). One exception to this general trend is represented by the DR4 

antigen, for which significant differences in peptide specificity between the 0401 and 0405 
have been reported. Since both alleles are quite frequent (in Caucasians and Orientals, 
respectively) we included both DR 0401 and 0405 in the set of representative DR binding 
assays. 

25 Our set of representative assays is mostly focused on allelic products of the 

gene, because these molecules appear to be the most abundantly expressed, serve as the 
dominant restricting element of most human class III responses analyzed thus far, and 
accurate methods for serologic and DNA typing most readily available. However, we 
have also considered in our analysis assays representative of DRB3/4/5 molecules (Table 

30 IVc). These molecules serve as a functional restriction element, and their peptide binding 
specificity has been previously shown to have certain similarities to the specificity of 
several common DR0, allelic products. 
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A general strategy for prediction of DR-degenerate binders. 

To test whether the 1-4-7 combined algorithm would also predict degenerate 
binding to other common DR types, we measured the capacity of three different groups of 
synthetic peptides to bind the panel of purified HLA DR molecules. The three different 
5 peptide sets were: A) 36 peptides which did not score positive in the combined 1-4-7 
algorithm (non-predictions), B) 36 peptides which did score positive for the 1-4-7 
algorithm, at the 75% cut off level, but had been found upon actual testing not to be 
degenerate 1-4-7 binders ("wrong" predictions), and C) 29 peptides which scored positive 
in the 1-4-7 algorithm, and also proved upon experimental testing, to be actual 1-4-7 
10 degenerate binders (correct predictions). The results of this analysis are shown in Table 
V. 

Within the set of "non-predictions" peptides (Table Va) only 3 out of 34 
(9%) bound at least two of the DR1, 4w4 or 7 molecules. Interestingly, 2 (1136.04 and 
1136.29) out of 3 of these peptides were also rather crossreactive, and bound additional 

15 DR types (DR2w2 p2, DR4wl5, 5wll and 8w2 in the case of 1136.04, and 2w2 p2, 

4wl5, 9 and 5wl2 in the case of 1136.29). Peptides from the "wrong predictions" peptide 
set (Table V5), by definition bound at the most only one of the DR1, 4w4 or DR7 
molecules, and were also poorly degenerate or other DR types with 
only two peptides (1136.22 and 1188.35) binding a total of three DR molecules. Within 

20 this peptide set, no peptide bound four or more of the DR molecules tested (data not 
shown). 

These results are contrasted by data obtained with the peptide set 
corresponding to peptides which were first predicted by the use of the combined 1, 4, 7 
algorithm, and then experimentally found to be degenerate DR1-4-7 binding. Fourteen out 
25 of 29 peptides tested (48%) bound a total of five or more alleles. Four of them were 
remarkably degenerate (1188.16, 1188.32, 1188.34 and F107.09) and bound a total of 
nine out of the 11 DR molecules tested. In conclusion, these results suggest that a strategy 
based on the sequential use of a combined DR1, 4, 7 algorithm and quantitative DR1, 4, 7 
binding assays can be utilized to identify broadly crossreactive DR binding peptides. 



30 
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Definition of the HLA-DR 1-4-7 supertype 

The data presented above also suggested that several common DR types are 
characterized by largely overlapping peptide binding repertoires. When this issue was 
analyzed in more detail, by analyzing the binding pattern of the thirty-two peptides from 
5 Table Va and b which were actual DR1-4-7 degenerate binders. Thirty-one of them (97%) 
bound DR1, 22 (69%) DR4w4 and 21 (66%) DR7. These files are contrasted with the 
low percentages of binding observed amongst the remainder non-degenerate binding 
peptides (17/67 (25%), 8/67 (12%) and 7/67 (10%), for DR1, 4w4 and 7, respectively) 
(Table VII). 

10 Interestingly, a large fraction of the 1-4-7 degenerate binders also bound 

certain other common DR types. Sixteen (50%) bound DR2w2a, 18 (56%) DR6wl9, 18 
(56%) DR2w2b and 20 (62%) DR9. In all cases, the frequency of binding in the non-1-4- 
7 degenerate peptide set was much lower (Table VIII). 

Significant, albeit lower, frequencies of cross reactivity were noted also for 

15 DR4wl5, DR5wll, and DR8w2 (in the 28 to 37% range). Finally, negligible levels of 
cross reactivity were observed in the case of DR3 and 5wl2 and DR53. Further studies 
will address whether either of these two group of molecules (DR4wl5, 5wll, and 8w2 on 
one hand, and DR3, DR53 and 5wl2 on the other) might belong to different DR 
supertypes. 

20 In conclusion, these data demonstrates that a large set of DR molecules 

encompassing DR1, 4w4, 2w2a, 2w2b, 7, 9 and 6wl9 is characterized by largely 
overlapping peptide binding repertoires. 

Discussion 

25 In the present report we have analyzed the peptide binding specificity of a 

set of 13 different DR molecules, representative of DR types common among the 
worldwide population. Detailed maps of secondary anchors and secondary interactions 
have been derived for three of them (DR4w4, DR1 and DR7). Furthermore, we 
demonstrated that a set of at least seven different DR types share overlapping peptide 

30 binding repertoires; and consequently that broadly degenerate HLA DR binding peptides 
are a relatively common occurrence. This study also describes computerized procedures 
which should greatly assist in the task of identification of such degenerate peptides. 
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We would like to discuss the data in the context of our current 
understanding of peptide-class II interactions, as well as in the context of the recently 
described class I supermotifs. Finally, the potential implications of broadly degenerate 
class II epitopes for epitope based vaccine design should also be considered. 
5 Firstly, our studies illustrate how the vast majority of the peptides binding 

with good affinity to DR4w4, DR1, DR7 and most of the other DR types analyzed in the 
current study (data not shown), are all characterized by a P1-P6 motif consistent with the 
one originally proposed by O' Sullivan, et al. Crystallographic analysis of DR1 -peptide 
complexes revealed that the residues occupying these positions engage two complementary 

10 pockets on the DR1 molecule, with the PI position corresponding to the most crucial 
anchor residue and the deepest hydrophobic pocket. Our analysis also illustrates how 
other "secondary anchor" positions drastically influence in an allele-specific manner 
peptide binding capacity. Position 4 was found to be particularly crucial for DR1 binding, 
position 9 for DR4w4, and position 7 for DR7. These data are consistent with previous 

15 results which originally described such allele-specific anchors, and with crystallographic 
data which illustrates how these residues engage shallow pockets on the DR molecule. 

Secondly, our studies illustrate how an approach based on alignment and 
calculation of average relative binding values of large peptide libraries allows definition of 
quantitative algorithms to predict binding capacity. The present study extends those 

20 observations to two other common HLA-DR types, and also illustrates how the combined 
use of the 1-4-7 algorithms can be of aid in identifying broadly degenerate DR binding 
peptides. 

The data presented herein suggest that a group of common DR alleles, 
including at least DR1, DR2w2a, DR2w2b, DR4w4, DR6wl9, DR7 and DR9 share a 

25 largely overlapping peptide repertoire. Degenerate peptide binding to multiple DR alleles, 
and recognition of the same epitope in the context of multiple DR types was originally 
described by Lanzavechia, Sinigallia's and Rothbard's groups. The present study provides 
a classification of alleles belonging to a main HLA-DR supertype (DRl-4-7-like) which 
includes DR1, DR2w2a, DR2w2b, DR4w4, DR7, DR9, DR6wl9. On the basis of the 

30 data presented herein, at least two additional groups of alleles exist. The first group 

encodes for molecules with significant, albeit much reduced overlap with the 1-4-7-like 
supertype (DR4wl5, 8w2, 5wll). The second group of alleles (5wl2, 3wl7, and w53) 
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clearly has little repertoire association with the 1-4-7 supertype. In this context it is 
interesting to note that Hammer, et al noted that good DR5wl 1 binding peptides are 
frequently characterized by positively charged P6 anchor (which would be poorly 
compatible) with the herein proposed 1-4-7 supermotif. It is also interesting to note that 
5 Sidney, et al proposed that DR3wl7 binds a set of peptides largely distinct from those 
bound by other common DR types. Future studies will have to determine whether any of 
the molecules listed above can be grouped in additional DR supertypes. Our group is 
currently investigating whether analysis of polymorphic residues lining the peptide binding 
pockets of DR can be utilized to aid in the classification and prediction of HLA DR 
10 supertypes. 

We would like to comment on similarities and differences between the HLA 
DR supertype described herein and the recently described HLA class I supermotifs. Class 
I supermotifs are clear-cut and, as a rule, non-overlapping. Four of them have been 
described all approximately equally frequent amongst the worldwide population. By 

15 contrast, the repertoire defining the HLA DR supertype herein described is not clear-cut 
and overlaps, at least in part, with the repertoire of other alleles. It also appears that on 
the basis of the data presented in Tables I and IV, even if other DR supertypes exist, the 
DR1-4-7 is going to be by far the most abundantly represented worldwide. 

Finally, we would like to point out the possible relevance of these data in 

20 terms of development of epitope based vaccines. Class II restricted HTL have been 

implicated in protection from, and termination of many important diseases. Inclusion of 
well defined class II epitopes in prophylactic or therapeutic vaccines may allow to focus 
the immune response towards conserved or subdominant epitopes, and avoid suppressive 
determinants. Based on the data presented herein (Table IV), the DR1-4-7 supertype 

25 would allow coverage in the 50 to 80% range, depending on the ethnicities considered. It 
is thus possible that broad and not ethnically biased population coverage could be achieved 
by considering a very limited number of peptide binding specificities. 

Based on the results present above, the sequences of various antigens of 
interest were scanned for the presence of the DR 1-4-7 motifs. Peptides identified using 

30 this approach are broadly cross reactive, class II restricted T cell epitopes. Table VIII 
presents a listing of such peptides derived from HBV, HCV, HIV and Plasmodium 
falciparum (Pf). A total of 146 peptides were identified: 35 from DHBV, 16 from HCV, 
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50 from HIV, and 45 from Pf. Standard conservancy criteria were employed in applicable 
cases. 

The above examples are provided to illustrate the invention but not to limit 
its scope. Other variants of the invention will be readily apparent to one of ordinary skill 
5 in the art. All publications, patents, and patent applications cited herein are hereby 
incorporated by reference for all purposes. 
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Table II 



An algorithm to predict DRB1*0401 binding capacity, 
a) Original peptide set 



No. of peptides (Binding nM) 

Selection High Inter. Non 

Criteria SlOO 100-1000 >1000 Total 

None 27 53 304 384 

P1-P6 27 50 182 259 

-17.00 M 27 45 84 156 

-16.44 B 25 35 47 107 



1) 
2) 



Algorithm score which predicts 90% of all binders. 
Algorithm score which predicts 75% of all binders. 
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Table II 

b) Blind test of the predictive power of the DRB1*0401 algorithm. 

No. of peptides (Binding nM) 



Selection High Inter. Non 

Criteria S100 100-1000 >i000 Total 

None 3 11 36 50 

P1-P6 3 9 28 40 

-17.00 3 8 7 18 

-16.44 3 4 2 9 
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Table III 

A combined "1-4-7" algorithm. 



Selection 
Criteria 



Degenerate 
Binders w 



Percent of Total 
Degenerate Binders 



None 
P1-P6 



Combined Algorithms 
(90% Cutoff Value) 



Combined Algorithms 
(75% Cutoff Value) 



73/384 
72/259 

67/147 
59/100 



100% 
99% 

92% 
81% 



1) Degenerate binders are defined as peptides binding at least two out of the three 
DR1, 4w4, and 7 molecules with an IC50 of 1 or less. 
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Table IV 

Fhenotypic frequencies of 10 prevalent HLA-DR antigens 



Fhenotypic Frequencies 



Antigen 


Alleles 


Cauc. 


Blk. 


Jpn. 


Chn. 


Hisp. 


Avg. 


DR1 


DRBr0101-03 


18.5 


8.4 


10.7 


4.5 


10.1 


10.4 


DR2 


DRBl*1501-03 


19.9 


14.8 


30.9 


22.0 


15.0 


20.5 


DR3 


DRBl*0301-2 


17.7 


19.5 


0.4 


73 


14.4 


11.9 




DRB1 0401-12 


23.6 


6.1 


40.4 


21.9 


29.8 


24.4 


DR7 


DRBl»0701-02 


26.2 


11.1 


1.0 


15.0 


16.6 


14.0 


DR8 


DRBr0801-5 


55 


10.9 


25.0 


10.7 


233 


15.1 


DR9 


DRBl'0901 1,09012 


3.6 


4.7 


245 


19.9 


6.7 


11.9 


DR11 


DRB1M101-05 


17.0 


18.0 


4.9 


19.4 


18.1 


153 


DR12 


DRB1M201-02 


25 


55 


13.1 


17.6 


5.7 


8.9 


DR13 


DRB1M301-06 


21.7 


165 


14.6 


12.2 


10.5 


15.1 


Total 




97.0 


83.9 


98.8 


95.5 


95.6 


94.7 
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Table VII 



Frequency of Binders 



I-4.7 Non 1-4-7 

DRType Degenerate Degenerate 

Binders (%) Binders (%) 



t 31/32 (97) 17/67 (25) 

4w4 22/32 (69) 8/67 (12) 

7 21/32 (66) 7/67 (10) 

9 20/32 (62) 2/67 (3.0) 

6W 19 18/32 (56) 6/67 (8.9) 

2w 2Bb 18/32 (56) 16/67 (24) 

2w 2fia 16/32 (50) 10/67 (15) 

4W 15 12/32 (37) 4/67 (6.0) 

8w2 10/32 (31) 3/67 (4.5) 

5wll 9/32 (28) 6/67 (8.9) 

1^ 3/32 (9.4) 4/67 (6.0) 

3W 17 1/32 (3.1) 0/67 (0) 

w53 2/16 (13) 7/43 (16) 
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Table Vin 



Sequence 


Source 


lit 
Po. 


Conservancy 


Predicted 
1-4-7 




Pf TRAP 


227 


100 


3 


11-0 vrrLALr rllr N 


Pf EXP1 


3 




3 


a\z>jv I JM~A IbV LACiLL 


Pf EXP1 


71 




3 


MKLAIbVLAGLLGN 


Pf EXP1 


73 




3 




Pf TRAP 


4 


100 


3 


i-d vrrJLALJ*rllrNK 


Pf EXP1 


4 




3 


L> V IN i^JJLrHiNCjKIIK 


Pf LSA1 


13 




3 




Pf EXP1 


1 




3 


MRKT ATT QVccrrr n/ 


Pf CSP 


2 


95 


3 




Pf CSP 


417 


95 


3 


NVKYI VTVFT TPcnr — — 


Pf ikap 


6 


100 


3 


SFYFTLVNI T TPT-TTM ~" 


Pf LSA1 


8 




3 


VFFL ALFT^ITFTJV pq 


Pf EXP1 


6 




3 


YFILVNLI.TFH1 Mrvr 


Pf LSA1 


10 




3 


YISFYFILVNLT TTO 


Pf LSA1 


6 




3 


ACLLCN VSTVT 1 nFv 


Pf EXP1 


82 




2 


ANOL W TLTDGrPDS 


I'l i KAP 


153 


100 


2 


AYKFWPGAATPYAf 


I't 1KAP 


514 


80 


2 


DKELTMSN VKNVSOT 


Pf T e a 1 
I I L5A1 


81 




2 


FN WNSSI GU MVLS 


If Uir 


413 


100 


2 


FYFILVNLUFHING 


I I LbAl 


9 




2 


GLAYKFWPCAATPY 


1 i IKAP 


512 


80 


2 


GRDVONNrVDFTKYR 


I s TRAP 


25 


90 


2 


HILYISFYFILVMLL 


Pf I C A 1 

1 1 LbAl 


3 




2 


HNWVN H AVPLAMKLI 


Pf TO AP 

I 1 1KAP 


"62 


80 


2 


IVFUFFDLFLVNGR 


Pf "TO a r» 

If InAP 


12 


100 


2 


KFWPCAATPYACEP 


Pf tl> a r> 
III KAI 


516 


80 


2 


KSLLRNLGVSENIFL 


Pf I CA 1 

1 1 LbAl 


98 




2 


KYLVTVFLIFFDLFL 


Pf TP AP 


8 


100 


2 


LAGLLGNVSTVLLGC 


Pf PVP1 
1 I CA1 I 


81 




2 


LGNVSIVLLGGVGLV 


Pf EXP1 


85 




2 


UFFDLFLV NCRDVQ 


Pf TRAP 


15 


100 


2 


LWILTDGIPDSIQD 


Pf TRAP 


156 


100 


2 


QLWILTDGIPDSIQ 


Pf TRAP 


155 


100 


2 


RGYY1PHQ55LPQDN 


Pf LSA1 


1669 




2 


RHNWVNHAVPLAMKL 


Pf TRAP 


61 


80 


2 


RHPFK1GS5DPADNA 


Pf EXP1 


107 




2 


SSVFNWNSSIGUM 


Pf CSP 


410 


95 


2 


VFNWNSSIGUMVL 


Pf CSP 


412 


95 


2 


VKNVIGPFMKAVCVE 


Pf TRAP 


223 


100 


2 


VKYLVIVFLIFFDLF 


Pf TRAP 


7 


100 


2 


V5TVLLCGVGLVLYN 


Pf EXP1 


88 




2 


WENVKNVIGPFMKAV 


Pf TRAP 


220 


100 


2 


YKFWPGAATPYACE 


Pf TRAP 


515 


80 


2 
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Table Vm 



PCT/US98/01373 



Sequence S 


ource 


1 at 


Conservancy 


r red let eo 
M-7 


ENRWQVMIVWQVDRM I 


flVl VIF 


2 


81 


3 


ERYUCDQQLLCIWCCS I 


nvi ENV 


589 




3 


ESELVSQHEQLDCK I 


4IV1 POL 


696 


80 


3 


FRKYTAFnPSINNE I 


flVl POL 


303 


93 


3 


GQMVHQAISPKTLNA 1 


*IV1 GAG 


172 


88 


3 


IPEWEFVNTPPLVKL I 


m\ POL 


593 


93 


3 


LPPWAKEIVASCDK I 


POL 


770 


87 


3 


NREILKEPVHGVYYD I 


HV1 POL 


485 


87 


3 


PAIFQSSMTKILEPF 1 


-nvi pol 


336 


80 


3 


PPWAKEIVASCDKC 1 


-tWl POL 


771 


87 


3 


QEQIGWMTN NPPIPV 1 


rHVl GAG 


276 


81 


3 


QGQMVHQAISPRTLN 1 


hHVl GAG 


171 


85 


3 


SPAIFQSSMTKILEP 


KT/1 POL 


335 


80 


3 


TLNFPISPIETVPVK 


HIV1 POL 


176 


100 


3 


VK NW MTETLLVQN AN 


HIV1 GAG 


348 


81 


3 


VPVWKEATTTLFCAS 


HIV1 ENV 


54 


81 


3 


WEFVNTPPLVKLWYQ 


HIV1 POL 


5% 


93 


3 


WVKVVEEKAF5PEVI 


HIV GAG 


187 


33 


3 


YYCVPVWKEATTTLF 


HIV1 ENV 


51 


83 


3 


ASDFNLPPWAKHV 


MVl POL 


765 


80 


2 


ASGYIEAEVIPAETG 


HIV1 POL 


822 


93 


2 


DFNLPPVVAKHVAS 


HIV1 POL 


767 


87 


2 


EAURILQQLLFIHF 


HIV1 VPR 


58 


82 


2 


EKVYLAWVPAHKCIG 


HIV1 POL 


711 


93 


2 


ETAYFLLKLAGRWPV 


HIV POL 


838 


65 


2 


EVQLC1PHPACLKKK 


HtVt POL 


268 


80 


2 


FWEVQLCIPHPACLK 


HIV1 POL 


266 


100 


2 


GCTLNFPISPTETVP 


HIV1 POL 


174 


100 


2 


CHYKRWULCLNKI 


HIV1 GAG 


294 


85 


2 


CTVLVCFTPVNIIGR 


HIV1 POL 


153 


100 


2 


HKAICTVLVGPTPVN 


HTV1 POL 


149 


93 


2 


ICTVLVGTTPVNnG 


HIV POL 


152 


74 


2 


KRWIILGLNK3VRMY 


HTV1 GAG 


298 


88 


2 


KV YLAWVP AHKG1GG 


HIV POL 


712 


74 


2 


UCITAVPWNASWSNK 


HIV1 ENV . 


607 




2 


LLQLTVWGIKQLQAR 


HIV1 ENV 


731 


80 


2 


NFPISPIETVPVKLK 


HTV1 POL 


178 


100 


2 


PQGW KG5P A1FQ5SM 


HIV1 POL 


329 


87 


2 


PVNnGRNLLTQICC 


HTV1 POL 


161 


87 


2 


QHLLQLTVWGIKQLQ 


HIV1 ENV 


729 


80 


2 


QQHLLQLTVWCIKQL 


HTV1 ENV 


728 


80 


2 


SPEVIPMFSALSEGA 


HTV1 GAG 


197 


88 


2 


TKELQKQITKIQNFR 


HTV POL 


952 


67 


2 


TVLVCPTPVNIICRN 


HIV1 POL 


154 


100 


2 


VEAIIRILQQLLFIH 


HIV1 VPR 


57 


82 


2 


VIPMF5ALSEGATPQ 


HTV1 GAG 


200 


88 


2 


VNHGRNLLTQIGCT 


HTV1 POL 


162 


87 


2 


WCCSCKUCITAVPWN 


HIV1 ENV 


601 




2 


WnLCLNKIVRMYSP 


HIV1 GAG 


300 


88 


2 


YKRWIILGLNKIVRM 


HIV1 GAG 


297 


88 


2 


FILVNLUFHINGKI 


Pf LSA1 


11 




3 
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Table VIII 



PCT/US98/01373 



Sequence 


source 


1st 
Fos 


Conservancy 


Predicted 
1-4-7 


A&1JLN LlsfM LN VSIP W 


HBV POL 


38 


95 


3 


ULJNL,L»NLJN V51i'W Iti 


HBV POL 


40 


95 


3 


orrlXlivlLTlPQSL 


HBV ENV 


181 


80 


3 


Ir Lr I LL L^LLr LLV 


HBV ENV 


245 


80 


3 


IN UN V btl W I HKVGNF 


HBV POL. 


45 


95 


3 


m-LAk^r i^ah-sw 


HBV POL 


523 


95 


3 


VY L^LL VPfr VQWF 


HBV ENV 


332 


100 


3 


CTJCT T A OPTt A T r 

orrLLAQFTSAICSV 


HBV POL 


522 


95 


3 


D V W L3L.L. V I'P V *J 


HBV ENV 


330 


80 


3 


Ar o I M UU V VLGAKSV 


HBV POL 


546 


90 


2 




HBV ENV 


180 


80 


2 


rvywrvotsnvvYLb 


HBV ENV 


342 


95 


2 


l*/\nJU5LJ£V«Ll' V ^Afrb 


HBV X 


50 


90 


2 


vj 1 31* V I Vl^ALNl AU 


HBV POL 


774 


80 


2 


OVWiRl 11 A x Kl 1 N A 


HBV NUC 


123 


95 


2 


HLbLKl*Lr V wVrsb A 


HBV X 


52 


90 


2 


1 IrLrl LLLwLLrLL 


HBV ENV 


244 


80 


2 


I Li- LV-Llr LL V LLU I 


HBV ENV 


249 


95 


2 


1 V Lj HAjj* A Al Jp 1 ^VAj 


HBV POL 


636 


90 


2 


JSXAV1 [N 1*1^3 LI NLLo 


HBV POL 


406 


95 


2 


Lt/WJr I Or\ IVwD V V JM\>\ 


HBV POL 


526 


95 


2 




HBV ENV 


252 


95 


2 


LCOVFA D ATPTCVJCZl 


MbV I*UL 


694 


95 


2 


LHLYSHP1TI rrpRJfl 


riDV POL 


501 


80 


2 


l,in TFT T VT T DYOn 


um; ckt\/ 
MbV ENV 


251 


95 


2 


LVLLDYOCMLPVCPL 


tlOV fclNV 


ICO 


on 


2 


LVPFVOWFVCI <;PTV 


rioV bNV 




oc 


2 


PLPIHT AELLA ACTA 


unu tvit 
rtov i\Ju 


722 


on 


2 


OCGYPAl MPT YAfTO 


trot/ new 
rloV rvJL 


648 


95 


2 


RDLLDTASAI YRFAT 


um/ KIT V* 

ribV NUC 


28 


80 


2 


SFC VWIRTPP A YR PP 


ruJV NUL 


121 


90 


2 




HBV rUL 


750 


85 


2 


VGI T PPA APFTfWY 


ribV POL 


637 


95 


2 


VPN I CKt TMT I QCKIT 


HDV POL 


409 


85 


2 




tim; new 
HdV POL 


404 


95 


2 


KVLVLNPSVAATT (TP 


MCV 


1255 


100 


3 


rTLWARMILMTHFFS 


n\.v 


■>R7n 




3 


ADLMGYIPLVGAPLC 


14CV 


Ml 


/y 


2 


AVOW MNRLI AFASRG 


n\.V 


1017 


inn 

lull 


2 


DLEUT5CSSNVSVA 


riv— v 




O'X 


2 


DLYLVTRHADVIPVR 


HCV 


1134 


79 


2 


EDlVNlLPAILSPCA 


HCV 


1882 


79 


2 


nTLPALSTCUHLH 


HCV 


684 


79 


2 


GARLWLATATPPGS 


HCV 


1345 


79 


2 


CIQYLACL5TLPGNP 


HCV 


1776 


100 


2 


GVNYATGNLPGCSFS 


HCV 


161 


79 


2 


IQYLAGLSTLPGNPA 


HCV 


1777 


100 


2 


LH GLS AF5LHSYSPG 


HCV 


2919 


79 


2 


VNLLPAJXSPGALW 


HCV 


1885 


79 


2 


VQW MNRLI AFASRCN 


HCV 


1918 


100 


2 


YKVLVLNPSVAATLG 


HCV 


1254 


100 


2 
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Peptide 



A A 



40 



Sequence 



Class II Peptides 

Source 



008.00 


1 6 


SALLSSDFTASVNCAK 


HEL 81*06 


200.06 


1 6 


SALSEGATPQDLNTML 


HIV gp25 41-56 


213.10 


16 


NKALELFRKDIAAKYK 


Sp. W. myo. 132-147 


506.01 


ZO 


NKALELFRKDIAAKYKELGY 


SW Myo 132-151 


506.03 


18 


ALELFRK01AAKYKELGY 


Sp. W myo. 134-151 


506.05 


1 6 


ELFRKDIAAKYKELGY 


Sp. W myo. 136-151 


570.01 


1 6 


MAKUAYDEEARFK3LE 


Haat Shock Prot 


705.06 


20 


KVYLPRMKMEEKYNLTSVLM 


Ova 279-298 


717.04 


14 


YASFVKTTTLRKFT.NH2 


combinatorial; DR2 optimized 


857.02 


20 


PHHTALRQAtLCWGELMTLA 


HBV core 50-69 


865.01 


1 S 


YKMKMVHAAHAKMKM 


OVA KM core extension 


F050.03 


20 


GFYTTGAVRQIFGDYKTTIC 


PLP 91-110 


F089.01 


15 


QNILLSN APLGPQFP 


Tyrosinase 66*70 


F098.03 


20 


AAYAAQGYKVLVLNPSVAAT 


HCV NS3 1242-1261 


F098.04 


20 


QYKVLVLNPSVAATUGFGAY 


HCV NS3 1248-1267 


F098.05 


1 4 


GYKVLVLNPSVAAT 


HCV NS3 1248-1261 


F09B.06 


19 


SYVNTNMGLKFRQLLWPHI 


HBV Cora 87-105 


F098.10 


1 2 


GLKFROLLWFHi 


HBV Core 94-105 


F134.04 


20 


TLHGPTPLLYRLGAVQNST 


HCV NS4 1-20 


F1 34.05 


20 


NFISGIQYLAGLSTLPGNPA 


HCV NS4 151-170 


F1 34.08 


21 


GEGAVQWMNRUAFASHGNHV . 


HCV NS4 293-313 (1914-1934) 


IA-p5 


1 7 


KPVSOMRMATPLLMRPM 


Mouse invariant chain 85-101 


Tr-28 p1 


24 


LPKPPKPVSKMRMATPLLMOAIFM 


Human invariant chain 80-103 


27.0279 


15 


EYLVSFGVWIRTPPA 


HBV NUC 117 


27.0280 


15 


GVWIRTPPAYRPPNA 


HBV NUC 123 


27.0281 


1 5 


RHYLHTLWKAGILYK 


HBV POL 145 


27.0283 


1 5 


VPNLQSLTNLLSSNL 


HBV POL 409 


27.0288 


1 5 


WVTVYYGVFVWKEAT 


H1V1 ENV 47 


27.0293 


1 5 


YYGVPVWKEATTTLF 


HIVl ENV 51 


27.0294 


1 5 


VPVWKEATTTLFCAS 


HIV1 ENV 54 


27.0295 


1 5 


LSGIVQQQNNLLRAI 


HIVl ENV 711 


27.0296 


IS 


OQHLLQLTVWSIKQL 


HIV1 £NV 728 


27.0297 


1 5 


QKLLOLTVWG1KQLQ 


H1V1 ENV 729 


27.0298 


1 5 


LLQLTVWGiKQLQAR 


H1V1 ENV 731 


27.0304 


15 


CX3QMVHQAISPRTLN 


HIV1 GAG 171 


27.0307 


15 


SPEVIPMFSALSEGA 


HIV1 GAG 197 


27.0310 


15 


QEQIGWMTNNPP1PV 


H1V1 GAG 276 


27.0311 


15 


GEIYKRW11LGLNK1 


HIV1 GAG 294 


27.0312 


1 5 


YKRWl lUGLNKIVRM 


HIVl GAG 297 


27.0313 


1 5 


KRWltLGLNKIVRMY 


HIV1 GAG 298 


27.0314 


1 5 


WULGLNKIVRMYSP 


HIVl GAG 300 


27.031 S 


1 5 


VKNVVMTETULVaNAN 


HIV1 GAG 348 


27.0322 


1 5 


GTVLVGPTrVNi K3H 


uiv/1 pni in 

niVl rUL 00 


27.0324 


1 5 


PVNIIGRNULTQIGC 


HIVl POL 161 


27.0326 


15 


GRI^LLTOIGCTINFP 


HIVl POL 166 


27.0328 


1 5 


TLNFPISPIETVPVK 


HIVl POL 176 


27.0329 


1 5 


NFPISPIETVPVKLK 


HIV1 POL 178 


27.0341 


1 5 


FRKYTAFT1PSINNE 


HIV1 POL 303 


27.0344 


1S 


SPAIFQS5MTKILEP 


HIV1 POL 335 


27.0345 


16 


PAIFQSSMTKILEPF 


HIVl POL 336 


27.0349 


15 


QKLVGKLNWASQIYA 


HIV1 POL 437 


27.0350 


1 5 


VGKLNWASQIYAGIK 


HIVl POL 440 


27.0351 


15 


NREILKEPVHGVYYD 


H1V1 POL 485 


27.0353 


15 


IPEWHFVMTPPLVKL 


HIV1 POL 593 


27.0354 


1 5 


WEFVNTPPLVKLWYQ 


HIVl POL 596 


27.0360 


15 


EQUKKEKVYLAWVP 


HIV1 POL 705 
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Class 11 Peptides 



Peptide 


AA 


Sequence 


Source 


27.0361 


15 


EKVYLAWVPAHKGIQ 


HIV1 POL 711 


27.0364 


15 


HSNWRAMASDFNIPP 


HIV1 POL 758 


27.0370 


15 


ASGYIEAEVJPAETG 


HIV1 POL 822 


27.0372 


15 


AEHLKTAVOMAVFiH 


HIV1 POL 911 


27.0373 


15 


KTAVQMAVRHNFKR 


HIVl POL 91 S 


27.0377 


15 


QKQTTX1QNFRVYYR 


HIV1 POL 956 


27,0379 


15 


KLLWKGEQAW10DN 


HIVl POL 982 


27.0381 


1S 


ENRVVOVMIWOVDRM 


HIV1 V1F 2 


27.0382 


15 


VEAJIRILQQLLF1H 


H1V1 VPR 57 


27.0384 


15 


FNWNSSIGUMVLS 


Pf CSP 413 


27.0387 


16 


MNYYGKQENWYSLKK 


Pf CSP 53 


27 .0388 


15 


MRKLAILSVSSFLFV 


pr CSP 2 


27.0390 


15 


NSSIGUMVLSRJL 


PI CSP 417 


27.0392 


15 


SSVFNWNSSIGUM 


pr csp 410 


27.0393 


15 


MKILSVFFLALFFII 


Pf EXP1 1 


27.0398 


1S 


F1LVNLUFHINGKI 


Pt. LSA1 11 


27.0400 


1 5 


HILYISFYFILVNLL 


Pf LSA1 3 


27.0402 


15 


LUFHINGKIIKNSE 


Pf LSA1 16 


27.0403 


15 


LVNLL1FHINGK1IK 


Pf LSA1 13 


27.0406 


15 


NLUFHINGKIIKNS 


Pf LSA1 15 


27.040B 


1 5 


GfTNFKSLLRNLGVSE 


Pf LSA1 94 


27.0412 


1 5 


AYKFWPGAATPYAG 


Pf SSP2 514 


27.0415 


1 5 


NVKYLVJVFUFFDL 


Pf SSP2 6 


27.0417 


15 


VKNV1GPFMKAVCVE 


Pf SSP2 223 


27.0418 


15 


WENVKNVIGPFMKAV 


Pf SSP2 220 


1186.04 


1 5 


CSWRRAFPHCLAFS 


HBV POL 534 


1186.06 


15 


FVQWFVGLSPTVWLS 


HBV ENV 342 


1186.10 


15 


LAQFTSAICSWRRA 


HBV POL 526 


1186.1 S 


15 


LVPFVQWFVGLSPTV 


HBV ENV 339 


1186.18 


16 


NLSWLSLDVSAAFYH 


HBV POL 422 


1186.25 


15 


SFGVWIRTPPAYRPP 


HBV NUC 121 


1186.26 


15 


SPFLLAQFTSAICSV 


HBV POL 522 


1186.27 


15 


SSNLSWLSLDVSAAF 


HBV POL 420 


1188.01 


15 


DKELTMSNVWWSQT 


PI LSA1 81 


1188.13 


1 5 


AGLLGNVSTVLLGGV 


Pf EXP1 82 


1188.16 


1 6 


KSKYKUATSVLAGLL 


Pf EXP1 71 


1188.32 


1 5 


GLAYKFVVPGAATPY 


Pf SSP2 512 


1188.34 


1 5 


HMWvNHAVPLAMKU 


PI 55P2 62 


1188.35 


IS 


IGPFMKAVCVEVEKT 


Pf SSP2 227 


1188.38 


1 5 


KYK1AGGIAGGLALI 


Pf SSP2 494 


1188.45 


1 5 


RHNWVNHAVPLAMKL 


Pf SSP2 61 


F091.1S 


16 


IKORNMWQEVGKAMY 


HfVl ENV 566 


F 107 .03 


1 6 


LQSLTNLLSSNLSWL 


HBV POL 412 


F1 07.04 


1 5 


PFLLAOFTSAlCSW 


HBV POL 523 


F1 07.09 


15 


KYKUATSVLAGLLGN 


Pf EXPl 73 


F 107.10 


1 5 




Pf EXP1 81 


F107.11 


15 


RHPFKK2SSDPADNA 


Pf EXP1 107 


F107.14 


1 5 


ANQLW1LTDGIPDS 


Pt SSP2 153 


F107.17 


1 5 


KFWPGAATPYAGEP 


Pf SSP2 516 


F1 07.23 


15 


VFNVVNSS1GLIMVL 


Pf CSP 412 


35.0093 


15 


VGPLTVNEKRRLKU 


HBV POL 96 


35.0096 


15 


ESRLWDFSQF5RGN 


HBV POL 387 


35.0100 


15 


LCQVFADATPTGWGL 


HBV POL 683 


35.0106 


1 5 


VVWATDALMTGYTG 


HCV 1437 


3S.0107 


1 5 


TVDFSLDPTTnETT 


HCV 1466 


35.0125 


15 


AETFYVDGAANRETK 


HIV POL 619 



2 
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Peptide AA Sequence 
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Class 11 Peptides 

Source 



35.0127 

35.0131 

35.0133 

35.0135 

35.0171 

35.0172 

1280.02 

1280.03 

1280.04 

1280.06 

1280.08 

1280.09 

1280.12 

1280.13 

1280.15 

1280.16 

1280.21 

1280.22 

1280.23 

1280.25 

1283.02 

1283.10 

1283.11 

1283.12 

1283.13 

1283.14 

1283.16 

1283.17 

1283.20 

1283.21 

1283.22 

1283.24 

12B3.26 

1283.30 

1283.31 

1283.33 

1283.34 

1283.36 

12B3.37 

1283.44 

1283.50 

12B3.S5 

1283.57 

1283.61 

1298.02 

1298.03 

129B.04 

1298.06 

1298.07 

1298.08 

1298.10 

1298.11 

1298.13 

1298.16 

F1 25.02 

F12S.04 



1 5 EVNIVTOSQYAU3U 

1 5 WAG1KQEFG1PYNPQ 

1 5 GA WJQDNSD) KWP 

1 5 YRKl LRORKIDRUD 

1 5 POSQDSLKESRKLN 

1 5 KCNLYADSAWENVKN 

1 5 IGTVLVGPTPVNHG 

1 6 KVYLAWVPAHKGIGG 
1 5 TKELQKOrrWQNFR 

1 5 AGFFLLTR1LT1POS 

1 5 GFFULTR1 LTIPQSL 

1 5 GTSFVYVPSALNPAD 

1 5 HFLFILLLCUFLL 

1 5 KFAVPNLQSLTNLLS 

1 5 LHLYSHPULGFRKl 

1 5 UCUFLLVUI3YQG 

1 5 VGLLGFAAPFTQCGY 

1 5 FYFILVNUUFHING 

1 5 KSLJLRNLGVSENtFL 

1 5 RGYYIPHQSSLPQDN 

1 S VYLLPRRGPRLGVRA 

1 5 GHRMAWDMMMNWSPT 

1 5 CGFVTCFTPSPWVG 

1 5 VYCFTPSPWVGTTD 

15 GNWFGCTWMNSTGFT 

1 5 FTTLPAL5TGUHLH 

1 5 5KGWBLLAPITAYAQ 

1 5 DLYLVTRHADV1PVR 

1 5 AQGYKVLVLNPSVAA 
1 5 GYKVLVLNPSVAATL 
1 5 VLVLN PSVA ATLGFG 

1 5 GARLWLATATPPGS 
1 5 DVWVATDA1MTGYT 
1 S FTGLTHIDAHFISQT 
1 5 YLVAYQATVCARAQA 
1 5 L£WTSTWVLVGGVL 
1 S TVWLVGGVLAALAAY 
1 5 AKH MWNRSGIQ YlA 

1 5 IOYLAGLSTLPGNPA 
1 5 MNRUAFASRGNHVS 
1 5 SYTWTGALfTPCAAE 
1 S GSSYGFOTSPGCRVE 
1 5 LEUTSCSSNV5VAH 
1 5 ASCLRKLGVPPIRVW 
1 5 VGNFTGLYSSTVPVF 
1 5 TNFHSU3IHLNPNK 
1 5 KQCFRKLPVNRPIDW 
1 S KQAFTFSPTYKAFUC 
1 5 AANWILRGTSFVYVP 
1 S PDRVHFASPLHVAWR 
1 5 IRPWSTQOLNGSL 
1 5 RSELYKYKWKIEPL 
1 5 DRFYKRRAEQASOE 

1 5 KVILVAVHVASGYIE 
1 7 LVNLUFHINGKIIKNS 

1 6 RHNWVNHAVPLAMKU 



HIV POL 674 
HIV POL 874 
HIV POL $89 
HIV VPU 31 
PI SSP2 165 
Pf SSP2 211 
HIV POL 152 
HIV POL 71 2 
HIV POL 952 
HBV ENV 180 
HBVENV1B1 
HBV POL 774 
HBV EN V 244 
HBV POL 406 
HBV POL 501 
HBV ENV 251 
HBV POL 637 
Pf LSA1 9 
Pf LSA1 98 
Pf LSA1 1669 
HCV Core 34 
HCV El 315 
HCV NS1/E2 506 
HCV NS1/E2 509 
HCV NS1/E2 550 
HCV N51/E2 684 
HCV NS3 1025 
HCV NS3 1134 
HCV NS3 1251 
HCV IMS3 1253 
HCV NS3 12S6 
HCV NS3 1345 
HCV NS3 1438 
HCVNS3 1567 
HCVNS3 1591 
HCV NS4 1658 
HCV NS4 1664 
HCV NS4 1767 
HCV NS4 1777 
HCV N54 1921 
HCV NS5 2456 
HCV NS5 2641 
HCV NS5 2813 
HCV NS5 2939 
HBV POL 53 
HBV POL 56B 
HBV POL 615 
HBV POL 661 
HBV POL 764 
HBV POL 824 
HIV1 ENV 333 
HIV1 ENV 637 
HIV1 GAG 333 
HIV1 POL 813 
PI LSA1 13 
Pf SSP2 61 
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WHAT IS CLAIMED IS: 

1. A composition comprising an isolated peptide that induces a CTL 
response and a T helper peptide comprising a motif of about nine residues wherein the first 
position from the N terminus of the motif is Y, F, W, L, I, V, M and the sixth position 

5 from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 

2. The composition of claim 1, wherein the T helper peptide consists of 
between about 10 and about 24 residues. 

10 3. The composition of claim 1, wherein the T helper peptide is derived 

from a viral antigen. 

4. The composition of claim 3, wherein the viral antigen is from HIV, 
HBV, or HCV. 

15 

5. The composition of claim 1, wherein the T helper peptide is derived 
from a parasite. 

6. The composition of claim 5, wherein the antigen is Plasmodium 

20 falciparum. 

7. The composition of claim 1, wherein the peptide that induces a CTL 
response is linked to the T helper peptide. 

25 8. A method of inducing a CTL response in a patient, the method 

comprising contacting a cytotoxic T cell from the patient with an isolated peptide that 
induces a CTL response and a T helper peptide comprising a motif of about nine residues 
wherein the first position from the N terminus of the motif is Y, F, W, L, I, V, M and the 
sixth position from the N terminus of the motif is S, T, C, A, P, V, I, L, M. 

30 



WO 98/32456 PCT/US98/01373 

44 

9. The method of claim 8, wherein the step of contacting is carried out 
by administering to the patient a pharmaceutical composition comprising the nucleic acid 
encoding the peptide that induces a CTL response and the T helper peptide. 

10. The method of claim 8, wherein the the peptide that induces a CTL 
response is linked to the T helper peptide. 

11. A composition comprising a peptide as shown in Table VIII. 

12. A method of inducing a helper T cell response in a patient, the 
method comprising contacting a helper T cell with a peptide of claim 11. 

13. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising the peptide. 



14. The method of claim 12, wherein the step of contacting is carried 
out by administering to the patient a pharmaceutical composition comprising a nucleic acid 
encoding the peptide. 
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1. I I Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. | I Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



1. ["J As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3 . | [ As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. | x| No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention fust mentioned in the claims; it is covered by claims Nos.: 
first species 



Remark on Protest 



| | The additional search fees were accompanied by the applicant's protest. 
[ | No protest accompanied the payment of additional search fees. 
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A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 

A61K 38/08, 38/10, 39/10,39/02, 39/12; C07K 7/00, 14/005, 14/20, 14/195, 14/725; C07H 21/04 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains claims directed to more than one species of the generic invention. These species are deemed 
to lack Unity of Invention because they are not so linked as to form a single inventive concept under PCT Rule 13.1. 
In order for more than one species to be searched, the appropriate additional search fees must be paid. The species are 
as follows: Each of the peptides claimed in claim 11 ( the peptides listed in Table VHI). 

The species listed above do not relate to a single inventive concept under PCT Rule 13.1 because, under PCT Rule 
13.2, the species lack the same or corresponding special technical features for the following reasons: Each of the 
peptides listed in Table VIII have unique amino acid sequences. Each of the peptides of Table VHI have distinct 
biochemical structure and properties, ie ability to bind to a particular HLA molecule or elicit peptide specific antibody. 
Accordingly, the peptides disclosed in Table VIII are not so linked by a special technical feature within the meaning of 
PCT Rule 13.2 so as to form a general inventive concept. 
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